New Trends in Classification and Data Mining
نویسندگان
چکیده
The observation of the lexical structure of the Bulgarian folklore is very important task for different science domains such as folkloristic, ethnology, linguistics, computational linguistics, Bulgarian language history, etc. Until today, such a linguistic analysis hasn’t been made; it is unclear what is the lexical structure of Bulgarian folklore works. First attempt for computational lexical analysis of the Bulgarian folklore and its constituents is made during the "Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage" 1. During the project the Bulgarian folklore digital library (BFDL) is designed and developed. In its structure it is implemented linguistic components, whose aim is the realization of different types of analysis of folk objects from a text media type. Thus, we lay the foundation of the linguistic analysis services in digital libraries aiding the research of kinds, number and frequency of the lexical units that constitute various folk objects. This paper presents basic types of dictionaries needed to carry out such linguistic analysis. It describes the BDFL Linguistics Search in sets of folklore objects of text media type and a linguistic component for frequency analysis of the folklore vocabulary. Finally, a project for implementation of a dictionary concordances of songs, prose, interviews, etc. is outlined.
منابع مشابه
Topic Modeling and Classification of Cyberspace Papers Using Text Mining
The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملThe application of data mining techniques in manipulated financial statement classification: The case of turkey
Predicting financially false statements to detect frauds in companies has an increasing trend in recent studies. The manipulations in financial statements can be discovered by auditors when related financial records and indicators are analyzed in depth together with the experience of auditors in order to create knowledge to develop a decision support system to classify firms. Auditors may annot...
متن کاملS3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization
Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...
متن کاملCustomer Retention Based on the Number of Purchase: A Data Mining Approach
Purpose: this study wants to find any relationship between the numbers of purchase and the income the customer brings to the company. The attempt is to find those customers who buy more than one life insurance policy and represent the signs of good payments at the same time by the help of data mining tools. Design/ methodology/ approach: the approach of this research is to use data mining tools...
متن کاملEmerging Trends in Associative Classification Data Mining
Utilising association rule discovery to learn classifiers in data mining is known as Associative Classification (AC). In the last decade, AC algorithms proved to be effective in devising high accurate classification systems from various types of supervised data sets. Yet, there are new emerging trends and that can further enhance the performance of current AC methods or necessitate the developm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010